|
|
Accession Number |
TCMCG009C21960 |
gbkey |
CDS |
Protein Id |
XP_030480018.1 |
Location |
complement(join(33969785..33969823,33969931..33970215,33970419..33970536,33976676..33976737,33976829..33976954,33977253..33977453,33977595..33977825,33978816..33978970,33979111..33979204,33979588..33979810,33980449..33980522,33980872..33980963,33981523..33982257,33982884..33983366,33983589..33983688,33984472..33984645,33984737..33984940)) |
Gene |
LOC115697235 |
GeneID |
115697235 |
Organism |
Cannabis sativa |
|
|
Length |
1131aa |
Molecule type |
protein |
Topology |
linear |
Data_file_division |
PLN |
dblink |
BioProject:PRJNA560384 |
db_source |
XM_030624158.1
|
Definition |
peroxisome biogenesis protein 1 isoform X1 [Cannabis sativa] |
CDS: ATGGATTTTGATGTCAAGTTGGTCGCTGGTATGGAGAGCTGCTTCGTGTCCTTGCCTCTCTTCCTAATCCAAACCCTCCAATCCTCCTCCTCCTCCGGTTACCTCCCGGAAGTTCTCGCTCTCGACCTACGCTCCCGTACCTCAGACGATGACCACTGGACCATAGCCTGGTCTGGCGCCACTTCGTCTTCTTCCTCAATCGAGATTGCTCAGCAGTTTGCGGATTGCATATCTCTAAGAGAGGGTACACGAGTCACAGTCCGAGCTCTGCCAAATGTGGCCAAAGCTACTTTGGTAACCATTGAACCAAATACTGAGGATGATTGGGAAGTTATGGAGCTCAACGCAGAGCTTGCAGAGGCAGCTATATTGAACCAGGTTAGGATAGTTCAGGAAAAGATGACGTTTCCTTTATGGTTGGGTGGCCGTACTATCATCATGTTTTGTGTGGTTTCAACTTTTCCCAAGGAAGCGGTGGTGCAACTTGTGCCAGGAACAGAAGTTGCAGTTGCTCCAAAGAGACGCAGGAAAAATTTAGACTCAAGTCATGATTCTTCCTTGTTATCTTCTAATAAAGCACGTCATACTGCAATGGCTCTACTCCGTGTTCAAGATGGAAACAGGAGACTAGTTCACAAAAGTTTTGTCGAAAGCATTGAGCTTGATGTAGTCCTCACTTCAGTGGCTATGGTACATCCAGAAACAGCAAAAATGTATGAACTTGATAGTCTTCAGTCCGTAATTTTAGTGCCTCGGTTATCATCAAAGGATAGCGTAAAAGATTCTGAAAAGAATGGAATGACAGTAAAGAGCAATCTTGCTTCAAAGGATGCTAAAATTGCAATTAAACAGGAACATCGTCAAGCAGTTGTGCGCATATTGGTTTCAGATTCGGTTGCCAAAGGGCATGTGATGATCGCCCACTCTCTTCGTCTTTACTTGAGGGCTGGCCTGCATTCATGGGTTTATTTAAAGCGATGCAGTCAATTGCAAAAAGATATTCCCTCACTTTCACTTTCTCCTTGTCATTTTAAGGTAGAAAAAATTAAGCATTCAGAGAAGAATGATTTTGAAGTGCTTGATAACCAAAAAAACCGCAGAACAAAAAATTTGCATCTCAATACTAGTTCAGTAGCTTATATGAATGTTGTAGACTGGGCAACCCATGAGGAAGTTGTTGCTGCTCTTTCACATGAATCTCATTGCAAAGAGGATGAGAAGGGTCCTTGTAAGGATGAAAGTGCTAAGGGTCTAGAAAATCTTGTTAAAGCATGGTTCAGTGCGCAAGTTGATTCCATTTCATCGACTTCAGGAGTAAAAGTTACTTCACTAATTCTGGGAAGTGAAACATTGGTTCACTTTGAAGTGAAAGGCTACAAGTTTGGGTCACATAAAAATACTATGATGTCATCTAATGATTTTCTAGAGAACATAAATAAGCCTAGTAAACTGCCAGTTGAAATATTATATGTATTGACTATTCCTGAGGACTCCCATTTAGGCGGAAGTGCTTATGAGCTGGTTTTTGATGAAATAAATGAAGGGAACAATAATGATCTGCAAGGTGCATTGTCTAACAGGATTGGTGATCCTGTAACCTTTAAATGTGTCAGAGAGAGAATATTTGATGAAGATATAAGGACTGACATATCTTCTTTGGGTTGGATGGGGACAAGTGCTTCAGACGTTACAAATAGAATGATGATATTGCTATCCTCTACTTCAAGCATGTGGTTCAGTTCATACAATCTTCCTCTTCCAGGACATGTTCTAATATATGGACCTTCAGGGTCAGGGAAGACATTATTAGCAAAAGCTGTTGCCAAATTTCTTCAGGAACAAGAAGACTTCTTAACATACATTGTATTTGTATCTTGCTCTAAACTTGCTGTGGAGAAGGCCCAAACCATTCGTCAAACACTCTCTGGCTATATATCAGAGGCTTTAGATCATGCACCATCTCTTGTTATCCTTGATGATCTTGACTCTATTATTTCCTCTACTTCTGACTCAGAGGGATCTCAAATTTCAAGCTCTGTAACTGCACTGATAGAATTTCTGACAGATATTATGGATGAATATGGGGAGAAGGGAAATATTGCTTGCGGAATTGGCCCCCTTGCCTTCATAGCTTCTGTCAAGTCTTTGGAGGGACTACCTCAGTCATTGACTTCTTCAGGAAGGTTTGACTATCATGTTCAAATGCCTGCTCCTGCTGCCTCAGAACGGGCAGCCATACTGAAGCATGAAATTCAGAAGCGTTGCTTGCAATGTCCTGAAAGCATCTTACAAGATGTAGCTTCGAAATGTGATGGCTATGATGCATACGATCTGGAAATATTGGTTGATAGAACTGTTCATGCTGCCATTGGTAGATTTCTGTCTTGCCATTCTTCTTTGGACCAATGTGAAAAGCCCACTTTACTAAGGGATGATTTTTCTCGGGCAATGCACGATTTCCTTCCAGTAGCAATGAGGGAGGTTACTAAATCTGCTCCAGAAAGTGGTCGGTCTGGGTGGGATGATGTTGGTGGTCTTCTTGAGATTCAGAAGGGTATTAAAGAGATGATTGAATTGCCGTCCAAGTTCCCAGACATATTTGCACATGCACCATTAAGATTACGATCAAATGTTCTTTTATATGGACCTCCGGGTTGTGGGAAGACACACATAGTTGGTGCTGCCGCTGCTGCCTGTTCACTACGATTTATATCTGTCAAAGGGCCTGAGTTGCTGAACAAGTATATTGGTGCTTCTGAGCAAGCTGTTCGTGACATATTCTCCAAAGCTGCTGCTGCAGCCCCATGCCTTCTATTTTTTGATGAATTTGATTCTATTGCCCCCAAAAGAGGACATGACAATACTGGAGTAACAGACCGCGTTGTTAATCAATTTCTTACTGAATTAGATGGTGTTGAAGTTTTGACAGGGGTATTTGTTTTTGCGGCAACAAGTAGACCAGATTTACTTGATGCTGCACTTTTAAGACCTGGTAGGCTAGATCGCCTTCTTTTCTGTGATTTTCCATCTCAAGGTGAAAGGCTGGAGATTCTAAGTGTTCTCTCTAGAAAGCTACCACTCTCCAGTGATGTTGATTTAGATGCCATAGCTCATATGACTGAAGGTTTTAGTGGAGCTGACCTTCAAGCATTGCTCTCAGATGCACAGCTTGAAGCGGTTCATGACCTTCTGGGTGGTGAAAGCATGCATGAACCAGGGGAAAAGCCAGTAATTACTGACGCTCTCTTAAAATCCACTGCCTCCAGGGCAAGACCATCTGTTTCGGAAGCAGAAAAGCAACGCCTCTTCGGAATTTACAGCCAGTTTTTGGATTCCAAGAGATCTCTTGCTGCACAGTCGAGAAATGCAAAAGGCAAGAGAGCAACATTAGCGTAA |
Protein: MDFDVKLVAGMESCFVSLPLFLIQTLQSSSSSGYLPEVLALDLRSRTSDDDHWTIAWSGATSSSSSIEIAQQFADCISLREGTRVTVRALPNVAKATLVTIEPNTEDDWEVMELNAELAEAAILNQVRIVQEKMTFPLWLGGRTIIMFCVVSTFPKEAVVQLVPGTEVAVAPKRRRKNLDSSHDSSLLSSNKARHTAMALLRVQDGNRRLVHKSFVESIELDVVLTSVAMVHPETAKMYELDSLQSVILVPRLSSKDSVKDSEKNGMTVKSNLASKDAKIAIKQEHRQAVVRILVSDSVAKGHVMIAHSLRLYLRAGLHSWVYLKRCSQLQKDIPSLSLSPCHFKVEKIKHSEKNDFEVLDNQKNRRTKNLHLNTSSVAYMNVVDWATHEEVVAALSHESHCKEDEKGPCKDESAKGLENLVKAWFSAQVDSISSTSGVKVTSLILGSETLVHFEVKGYKFGSHKNTMMSSNDFLENINKPSKLPVEILYVLTIPEDSHLGGSAYELVFDEINEGNNNDLQGALSNRIGDPVTFKCVRERIFDEDIRTDISSLGWMGTSASDVTNRMMILLSSTSSMWFSSYNLPLPGHVLIYGPSGSGKTLLAKAVAKFLQEQEDFLTYIVFVSCSKLAVEKAQTIRQTLSGYISEALDHAPSLVILDDLDSIISSTSDSEGSQISSSVTALIEFLTDIMDEYGEKGNIACGIGPLAFIASVKSLEGLPQSLTSSGRFDYHVQMPAPAASERAAILKHEIQKRCLQCPESILQDVASKCDGYDAYDLEILVDRTVHAAIGRFLSCHSSLDQCEKPTLLRDDFSRAMHDFLPVAMREVTKSAPESGRSGWDDVGGLLEIQKGIKEMIELPSKFPDIFAHAPLRLRSNVLLYGPPGCGKTHIVGAAAAACSLRFISVKGPELLNKYIGASEQAVRDIFSKAAAAAPCLLFFDEFDSIAPKRGHDNTGVTDRVVNQFLTELDGVEVLTGVFVFAATSRPDLLDAALLRPGRLDRLLFCDFPSQGERLEILSVLSRKLPLSSDVDLDAIAHMTEGFSGADLQALLSDAQLEAVHDLLGGESMHEPGEKPVITDALLKSTASRARPSVSEAEKQRLFGIYSQFLDSKRSLAAQSRNAKGKRATLA |